OPTIMAL COUPLING FOR MEAN FIELD LIMITS 
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Abstract. We review recent quantitative results on the approximation of mean field dif- 
fusion equations by large systems of interacting particles, obtained by optimal coupling 
methods. These results concern a larger range of models, more precise senses of convergence 
and links with the long time behaviour of the systems to be considered. 



Let us consider a Borel probability distribution f t = ft(X) on M. d evolving according to the 
McKean-Vlasov equation 

i,j=X i=l 

Here a = {(Hj)x<ij<d is a nonnegative symmetric d x d matrix; moreover, given X in M. d 
and a Borel probability measure p on 

bi[X,p}= [ bi{X,Y)dp(Y), l<i<d 

where b(X,Y) = (bi(X, ^))i<i<d is a vector of M. d . Equation ([I]) has the following natural 
probabilistic interpretation: if /q is a distribution on the solution ft of (pQ) is the law 
at time t of the Revalued process (Xt)t>o evolving according to the mean field stochastic 
differential equation 

dX t = adB t -b[X t ,f t ]dt. (2) 
Here the d x d matrix a satisfies aa* = 2a, {Bt)t>o is a Brownian motion in M. d and ft is the 
law of X t in R d . It is a mean field equation in the sense that the evolution of Xt is obtained 
by averaging the contributions b(Xt,Y) over the system, according to the distribution dft(Y). 
Existence and uniqueness of solutions to (pQ) and ([2]) are proven in for globally Lipschitz 
drifts b and initial data /o with finite second moment. Non globally Lipschitz drifts are 
discussed in section [Q 

Two instances of such evolutions are particularly interesting. First of all, when M. d is the 
phase space of positions x € M. d and velocities v € with d = 2d', one is interested in the 
Vlasov-Fokker-Planck equation 

^+v-V x ft-{V x U* xP [ft))-V v f t = A v f t + V v -((A(v)+B(x))f t ), t > 0, x,veR d '. (3) 

Here a ■ b denotes the scalar product of two vectors a and b in whereas V„, and 
A„ respectively stand for the gradient, divergence and Laplace operators with respect to the 

velocity variable v £ Moreover p[ft](x) = / ft{x, v) dv is the macroscopic density in the 
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space of positions x € M d ' , or the space marginal of ft; U = U(x) is an interaction potential 
in the position space and * x stands for the convolution with respect to the position variable 
x £ M. d ; finally A(v) and B(x) are respectively friction and position confinement terms. This 
equation is used in the modelling of diffusive stellar matter (see [8] for instance) . 
We are also concerned with the space homogeneous equation 

^ = A v f t + V v -((V v V + V v W* v f t )ft), t>Q,veR d (4) 

with d = d! . Here V and W are respectively exterior and interaction potentials in the veloc- 
ity space, and this equation is used in the modelling of space homogeneous granular media 
(see[2j). 

The particle approximation of (H|) consists in introducing N processes (X l t ,N ) t >o, with 
1 < i < N, which evolve no more according to the drift b[Xt, ft] generated by the distribution 
ft as in ([2]), but according to its discrete counterpart, namely the empirical measure 

N 

\N _ 

N 



11 < T^'*' 



of the particle system [X] ,N , . . . ,X^' N ). In other words we let the processes {X l t ' N ) t >o solve 

1 N 

dX\> N = adB\--Y, b(Xl' N ,Xl N ) dt, 1 < i < N. (5) 
i=l 

Here the (i?£)t>o's are iV independent standard Brownian motions on M rf and we assume that 
the initial data Xq N for 1 < % < N are independent variables with given law /o. 

The mean field force b[X t , ft] in ([2]) is replaced in ([5]) by the pairwise actions — b(X l t ,N , X{' N ) 

of particle j on particle i. In particular, even in this case when the initial data Xq N are inde- 
pendent, the particles get correlated at all t > 0. But, since this interaction is of order 1/N, 
it may be reasonable that two of these interacting particles (or a fixed number k of them) 
become less and less correlated as N gets large. 

In order to state this propagation of chaos property we let, for each i > 1, (Xl) t >o be the 
solution of 

jdXi = adBl-b[XlJ t ]dt 

where ft is the distribution of X\. The processes (JQ) 4 >o with i > 1 are independent since 
the initial conditions and driving Brownian motions are independent. Moreover they are 
identically distributed and their common law at time t evolves according to (pQ), so is the 
solution ft of (H]) with initial datum /o. In this notation, and as N gets large, we expect the 
N processes (Xl ,N ) t >o to look more and more like the N independent processes (Xj)t>o'- 

Theorem [12]) If b is a Lipschitz map on R 2rf and /o a Borel distribution on W 1 with 

finite second moment, then, in the above notation, for all t > there exists a constant C such 
that 



it ~ N 



C 



for all N. 



This results also holds in a more general setting when the diffusion matrix can depend on 
X and on the distribution in a Lipschitz way, and can be stated at the level of the paths of 
the processes on finite time intervals. 

First of all, it ensures that the common law f^ ,N of any (by exchangeability) of the particles 
X\' converges to ft as N goes to infinity, according to 

Wl(fl> N J t )<nxi' N -Xtf<^- (7) 

Here the Wasserstein distance of order p > 1 between two Borel probability measures [i and 
v on W 1 with finite moment of order p > 1 is defined by 



W p ((i,v) =inf (E\X-Y 



ip\Vp 



where the infimum runs over all couples of random variables (X, Y) with X having law fx and 
Y having law v (see for instance). 

Moreover, it proves a quantitative version of propagation of chaos : for all fixed k, the law 
ft' N of any (by exchangeability) k particles X l t ' N converges to the product tensor (ft)® k as 
N goes to infinity, according to 

W2{ft*> (/*)**) < n(Xl' N , X*' N ) - (Xl ... , X^ < ^. 

It finally gives the following first result on the convergence of the empirical measure jlf of 
the particle system to the distribution ft- if ip is a Lipschitz map on then 

N „ N 

I 

E 



\Y,v{Xi> N )-f ^dft 2 <2E|v,(X^)-^)| 2 +2e|1^^)- / Vdft 2 <^ (8) 
by the Theorem and a law of large numbers argument on the independent variables X\. 



Recent attention has been brought to improve these classical results in three directions: 

1. enlarging the setting to non Lipschitz drift terms; 

2. providing more precise estimates on the approximation of the solution to ([T]) by the 
empirical measure of the particle system; 

3. providing time uniform estimates when possible, in connexion with the long time be- 
haviour of the solutions. 



1. Non Lipschitz drifts 

The interest for such mean field limits with locally but non globally Lipschitz drifts has 
been renewed by kinetic models, in which the interaction may become larger for larger relative 
velocities. 

For instance, for space homogeneous models such as with W(z) = \z\ 3 and d = 1, the 
difficulty brought by the non Lipschitz drift has been solved by convexity arguments, first in 
dimension one in [I], then more generally in any dimension in [7], [9]. 

Space inhomogeneous collective behaviour models are under study in [3], for which known 
convexity arguments do not apply. 
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2. Deviation bounds for the empirical measure 



The averaged estimate ([8]) ensures that the particle system is an appropriate approximation 
to solutions to ([I]). However, when the particle system is used for numerical simulations, one 
may wish to establish estimates making sure that the numerical method has a very small 
probability to give wrong results. 

This was achieved in [3J, [7J, [9], [TO] by the use of (Talagrand) transportation inequalities. 
It is proved in these works, under diverse hypotheses on the initial data and in diverse contexts, 
that for all t the law f 1 



N,N 



for all measures v on 



r,dN 



of the particle system at time t satisfies a transportation inequality 

c 

Here c may depend on t (but neither on N nor on v), and 



w 1 (uj t N 'y<-H(u\fr") 



N,N\ 



du 



df t 



N,N 



In- 



du 



dft 



N,N 



df] 



N.N 



is the relative entropy of v with respect to f^' N , to be interpreted as +oo if v is not absolutely 
continuous with respect to fl^' ■ Then an argument by S. Bobkov and F. Gotze, based on 
the Kantorovich-Rubinstein dual formulation 



Wi(i/,f t ) = sup 



$du 



<$>dfj*' N , $ 1-Lipschitz on 



t>dN 



(9) 



and the dual formulation of the entropy, ensures that 

N 



i=i 



fdf, 



1,N 



> r 



< e 



-cNr 2 



for all N > l,r > and all 1-Lipschitz maps (p on 
Since moreover 

N 

fdft 



(see [13} Chapter 22] for instance). 



i-X>(** ,iV )- / M < |^f>(^)-/ 

JV 



<pdf. 



1,JV 



+ W 1 (ti' N ,f t ) 



< 



by ©, ([7]) and the bound W\ < W2, this ensures one-observable error bounds like 



1 1 * 

I N 

i=l 



> 



C 

N +r 



<2e 



-cNr 2 



(10) 



for all N, r and all 1-Lipschitz maps ip on M. d . In this argument we see how well adapted to 
this issue are the Wasserstein distances: from their definition they can easily be bounded from 
above by simple estimates on the processes, as in (J7J), and in turn they lead to straightforward 
estimates on Lipschitz observables, by Q. 

Uniformly on Lipschitz observables, and for the Wasserstein distance W\ which, up to 
moment conditions, metrizes the narrow topology on measures, estimates like 

-XNr 2 



W x {$J t )>r 



< e 



(11) 
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or even 



sup Wi(/if ,f t ) > r 

0<t<T 



< e ~ cNr2 



were reached in [5], provided A" is larger than an explicit iVo(r), hence ensuring that the 
probability of observing any significant deviation during a whole time period [0, T] is small. 
Also bounds were obtained on the pointwise deviation of a mollified empirical measure around 
the solution f t . 

3. Time uniform estimates and long time behaviour 

Under convexity assumptions on the potentials V and W, the solution to the space homoge- 
neous granular media equation Q has been proven to converge algebraically or exponentially 
fast to a unique steady state (see [6] for an entropy dissipation proof based on interpreting @ 
as a gradient flow in the Wasserstein space, and also [7], [9], |10j). In this setting one can hope 
for time uniform constants in estimates (f8j)- (|10p - (|lip . which were obtained in [5], [7J, [9], |10| . 

Also convergence to equilibrium (through a contraction argument in W% distance) and time 
uniform deviation bounds were obtained in [lj for solutions to the (now space inhomogeneous) 
Vlasov-Fokker-Planck equation ([3]). 
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